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The questions and problems of the formation of knowledge bases of 
intelligent man-machine decision support systems are considered. 
The neuron-fuzzy model used in the work is described. The need for 
increasing the efficiency of the neuron-fuzzy model in the formation of 
knowledge bases is being updated. The task is to develop methods and 
algorithms for presetting and optimizing the parameters of a fuzzy neural 
network. To solve difficult formalized tasks, it is necessary to develop 
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ES developers are constantly faced with the problems of “extraction” and 
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: ormalization of knowledge, as well as the search for new ways to obtain it. 
Logic To do this, use the extraction, acquisition and formation of knowledge. 
Network : Currently, the formation of knowledge bases is relevant for the creation of 
Programming hybrid technologies - fuzzy neural networks that combine the advantages of 
System neural network models and fuzzy systems. The analysis of the efficiency of 
the fuzzy neural network carried out in the work showed that the quality of 
training of the NN largely depends on the choice of the number of fuzzy 
granules for input drugs. In addition, to use fuzzy information formalized by 
the mathematical apparatus of fuzzy logic, procedures are required for 
selecting optimal forms and presetting the parameters of the corresponding 
membership functions (MF). 
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1. INTRODUCTION 

Modern information systems for analyzing large amounts of information or managing complex 
processes cannot be imagined without elements of artificial intelligence [1-3]. Data mining methods allow 
building effective models of diagnostics, forecasting, decision making in many subject areas of human 
activity [4-10]. 

Such models are used in a wide class of intelligent information systems, especially in expert systems 
(ES), the main element of which is the knowledge base - a model represented by many systematized rules 
that describe patterns in the subject area under consideration. Therefore, the design of knowledge bases is an 
important task in the development of expert systems [11, 12]. 

The analysis of the capabilities of the neuro-fuzzy knowledge base formation model showed that the 
quality of training of a fuzzy neural network (NN) largely depends on the choice of the number of fuzzy 
granules for input linguistic variables (LV) [13-17]. In addition, to use fuzzy information formalized by the 
mathematical apparatus of fuzzy logic, procedures are needed for selecting optimal forms and initializing the 
parameters of the corresponding membership functions. For these reasons, to improve the accuracy of 
approximation of experimental data by fuzzy production rules, it is necessary to automatically select the 
optimal number of fuzzy granules of input linguistic variables of a fuzzy neural network and the 
corresponding forms and parameters of their membership functions. 
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2. METHODS 

Often, experts decide the problem of choosing the number of fuzzy gradations. For purely 
psychological reasons, they choose an odd number of values of a linguistic variable, for example, 3, 5, 7. 
Moreover, this choice is subjective and does not always reflect the real picture [18-20]. 

When using automatic methods, the choice of the number of values of a linguistic variable is based 
on the criterion of its optimality [21]. For this, methods of cluster data analysis are often used. To date, 
a large number of different algorithms have been developed and used to solve the clustering problem. 
The assumptions of traditional clustering algorithms determine the following factors that do not allow their 
full application in the developed methodology: 

a) A priori assumptions about the properties of clusters, the principles of combining objects, or setting the 
number of clusters are unacceptable; 

b) It is unacceptable to construct an algorithm only on the relation of points to the centers of clusters, 
and not on the basis of the relative positions of the points. 

c) The absence of an understandable linguistic interpretation of partitions is unacceptable. 

In this regard, to effectively solve the problem of clustering the values of the input parameters of a 
fuzzy neural network, the development of a special algorithm that takes into account the relationship between 
individual data points without being tied to the center of the cluster is relevant. 

The developed clustering algorithm is based on a fuzzy relationship apparatus using the concepts of 
a-tolerance and o-quasi-equivalence relations, which, respectively, make sense of pairwise comparison of 
data samples relative to a given sample and intergroup data comparison. To build a family of a-quasi- 
equivalence relations, a sequence of data comparison methods is used, each of which is based on the previous 
one and is more suitable for directly solving the fuzzy clustering problem [22]. 

a) Comparison by distance between data samples - pairwise comparison of data samples. 

b) Comparison using normal measures of similarity - one by one comparison of all data samples with each 
of the samples. The result is fuzzy sets of data samples that are close to each of the data samples. 

c) Comparison using relative similarity measures - pairwise comparison of data samples relative to a given 
sample. The result is the degree to which each pair of data samples belongs to the corresponding 
similarity relationship. 

d) Comparison of data samples using the ratio of a-tolerance on many data samples - the similarity of any 
two data samples relative to all other samples. The result is the degree to which each pair of data 
samples belongs to the a-tolerance ratio. 

e) Comparison using the ratio of a-quasi-equivalence and the scale of the ratio of a-quasi-equivalence is 
an intergroup similarity of data. The result is the degree to which each pair of data samples belongs to 
the ratio of a-quasi-equivalence. 

The clustering method in the algorithm is the use of a family of equivalence relations, each of which 
is obtained by switching from the a-quasi-equivalence relation to the equivalence relation in the classical 
sense using the corresponding level of a-quasi-equivalence relation from the scale of a-quasi-equivalence 
relation. Those data samples that, in accordance with the a-quasi-equivalence relation, have similarities 
exceeding the indicated level are equivalent, the rest are nonequivalent. 

Cluster partitioning is fuzzy - corresponding to the presence of a fuzzy data relationship. 
The number of partitions is finite and is determined by the power (the number of ratio levels) of the ratio 
scale of a-quasi-equivalence. Each concrete partition by clusters corresponds to a partition of the set of data 
samples into equivalence classes at a certain level of a-quasi-equivalence. 

Let the values of the input parameters of the neural network xi are given on a nonempty set X and 
the clustered object represents only one feature. We introduce the basic concepts and definitions. A normal 
measure of similarity in distance x, with x; is a measure that reaches its boundary values on the set X with the 
membership function, defined as follows. 


3. RESULTS AND DISCUSSION 

When training a fuzzy neural network, a study was conducted of the effectiveness of the technique. 
The quality of NN training in changing its output error was evaluated for various approaches to choosing the 
number of gradations of input drugs and the forms of their membership functions. Using a technique based 
on the developed algorithms, the quality of NN training was not inferior, and in many cases, superior to the 
quality of network training, in which the choice of the number of gradations of input neurons and forms of 
AF was determined subjectively by an expert. The considered example showed that when using the proposed 
methodology, the quality of training of the NN significantly improved. This indicates the practical feasibility 
and advisability of using this technique. Consider the problem of choosing the optimal forms of membership 
functions of the input LP values. The membership functions of a fuzzy set are traditionally built on expert 
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information. There are a significant number of such methods that can be divided into two groups: direct and 
indirect. The simulation took place in the SimInTech system. [23-28] 

Examples of direct methods are the direct assignment of MF to a table, schedule, or formula. 
In indirect methods, the values of the phase transitions are chosen in such a way as to satisfy predefined 
conditions. Expert information is only the initial information for constructing a function. 

The disadvantage of both groups of methods is a large share of subjectivity. A different approach to 
the construction of phase transitions is based on the parametric identification of fuzzy models based on 
experimental "input-output" data. Using this approach removes the subjectivity of constructing functions, 
but instead requires a training sample with representative examples of "inputs - output." In addition, 
phase transitions of identical fuzzy sets in meaning are obtained differently as a result of identification of 
various “input-output” dependencies. 


4. SUMMARY 

Currently, in fuzzy modeling systems, the most common among experts are triangular, trapezoidal, 
and Gaussian membership functions. In the proposed approach, for modeling fuzzy constraints, in addition to 
the three indicated, double and double Gaussian phase transitions are also used as generalizations of the 
trapezoidal and triangular functions, respectively. 

We assume that the observed observations and the processes under study obey a certain law 
described by the mathematical model, and deviations from it are random. In this case, the least squares 
method is best for estimating the accuracy of the approximation. Using this method and evaluating the 
residual variance, we determine the shape and initialize the parameters of the phase transition that most 
accurately describes the initial fuzzy set. 


5. CONCLUSIONS 

The analysis of the effectiveness of data mining methods and strategies for obtaining knowledge for 
expert systems has been carried out in order to justify the relevance of developing new mathematical methods 
and algorithms for the automated generation of ES knowledge bases, as well as the need to increase the 
efficiency of one of such methods, the neuron-fuzzy model. 
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